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Background of the Invention 
The invention relates to reagents and methods for expressing gene 
10 products in cardiac cells or precursors to cardiac cells /// vitro and /'/; vivo. 

Adult mammalian cardiomyocytes do not de-differentiate or re-enter the 
cell cycle. After being placed into cell culture, neonatal cardiomyocytes soon lose 
their ability to proliferate. Several cell lines, including PI 9 teratocarcinoma cells, 
embryonic stem (ES) cells, AT-1, H9c2, QCE-6, or lOTl/2 cells, have some 
1 5 molecular characteristics of cardiomyocytes. These cells are very difficult to 

manipulate, however, or are lacking important characteristics of cardiomyocytes. 
Because of these reasons, cultured neonatal cardiomyocytes from rats or mice are 
often used in in vitro systems, even though these cells are difficult to transfect 
(usually less than 0.1% transfection rate) and require long preparation procedures. 
20 Recently, it has been demonstrated that immortalized cardiac myogenic 

(CMG) cells can be differentiated from mouse bone marrow stromal cells. This is 
evidence of the generation of cardiomyocytes from a tissue of extra-cardiac origin. 

The possibility of bone marrow being an /'// vivo source of circulating 
cardiomyocytc progenitors has been previously suggested. A distribution of 



transplanted bone marrow-derived cells in a dystrophic mouse heart has been 
observed. Although the molecular characteristics of these cells were not 
identitled, their location in the heart tissue indicated these cells were 
cardiomyocytcs. Taken together, it appears that bone marrow stromal cells are an 
5 extra-cardiac source of cardiomyocytcs /// vivo, and /// vitro induction of beating 
cardiomyocytcs from a heterogeneous population of bone marrow cells is possible 
by the introduction of inductive agents such as 5-a/,acytidine. 

The molecular mechanisms which guide development of cardiac cells 
(and the heart in general) in vertebrates have been the subject of intense 

10 investigation (Fishman and Chien, Cell 91: 153-156, 1997; Olson and Srivastava, 
Science 272: 671-676, 1996). In most vertebrates, the heart tissue initially 
develops as a crescent shaped mesodermal structure located anteriorly and 
laterally. This precardiac mesoderm is brought ventrally and caudally, by folding 
of the embryo, to a form a single midline heart tube with the inflow region located 

1 5 most rostrally. This heart tube undergoes looping, bringing the inflow, ventricular, 
and outflow regions of the heart into the alignment seen in the mature heart. Later, 
chamber septation occurs, valves develop in the atrioventricular (AV) junction as 
well as in the outflow tract, and the outflow tract itself is divided into two great 
vessels. Additional refinements occur with the development of the coronary 

20 arteries and the cardiac conduction system. 

Heart development is governed by complex signals including inductive 
and positional signals from adjacent structures, as well as signals from a number of 
transcription factors (Fishman and Chien, Cell 91: 153-156, 1997; Lyons, Curr. 
Opin. Genet. Dev. 6: 454-460, 1996; Mohun and Sparrow, Curr. Opin. Genet. Dev. 

25 7: 628-633, 1997; Olson and Srivastava, Science 272: 671-676, 1996). Since 



transcriptional factors have the abihty to activate multiple genes, they are generally 
considered important regulators of organ development. A number of cardiac 
transcription factors have been identified that ha\e important influences on the 
early stages of specification and differentiation of the cardiac mesoderm (Tanaka 
et al., Dev. Genet. 22: 239-249, 1998). C's\/Nk\2.5 (Komuro and l/umo, Proc. 
Natl. Acad. Sci. USA 90: 8145-8149, 1993; Lints etal.. Development 1 19: 419- 
431, 1993), MEF-2C (Edmondson etal., Development 120: 1251-1263, 1994), 
GATA4 (Hcikinheimo et al., Dev. Biol. 164: 361-373, 1994; Kelley et al., 
Development 1 18: 817-827, 1 993) and dHAND and eHAND are members of four 
different classes of transcriptional factors all expressed in the heart at early stages 
of development. Targeted disruption of any one of these genes yields severe 
cardiac and extracardiac phenotypcs, and results in death of the embryo between 
E9.5 and El 0.5 of development. 

The mouse Csx/Nkx2.5 gene is first expressed in the cardiac progenitor 
cells at embryonic day 7.5 (E7.5), and during this stage is detected principally in 
the heart and tongue (and, to a lesser extent, in spleen, stomach, liver, and larynx). 
The extra-cardiac expression Csx/Nkx2.5 is markedly reduced afterbirth, 
however, and in the adult, the expression is confined to the heart. Mice in which 
the Csx/Nkx.2-5 gene has been deleted have no functional heart, causing an 
embryonic lethality by E9.5-1 1.5. 

Most tissue-specific gene expression is controlled by enhancer and 
repressor sequences at the transcriptional level. Cienerally, to confer tightly- 
regulated expression, enhancers adopt complex regulatory mechanisms that require 
the collaboration of multiple transcription factors. The binding sites 
for these transcription factors may be many kilobases (kb) from the gene promoter 



and dispersed relative to each other. 

It is desirable to be able to express genes in a cardiac cell-speeific 
manner. This would be useful, for example, for the targeted expression of genes 
encoding therapeutic proteins for the treatment of damaged heart tissue. 
5 Moreover, to maximize the utility of stem cell-derived cardiomyocytes, for 

example, in the treatment of damaged heart tissue in humans and other animals, it 
is desirable to be able to rapidly purify cardiac cells from a potentially 
heterogenous cell population. 

Accordingly, there is a need for the development of reagents and 
10 methods for achieving cardiac cell-specific gene expression. The present 
invention provides these reagents and methods. 

Summary of the Invention 
In a first aspect, the invention features a substantially purified nucleic 
acid molecule comprising an enhancer element having: (a) 100% identity to 40 
1 5 contiguous nucleotides of the nucleic acid molecule shown in SEQ ID NO.: 1 or 
SEQ ID NO.: 3; (b) at least 91% identity to 50 contiguous nucleotides of the 
nucleic acid molecule shown in SEQ ID NO.: 2; (c) at least 97% identity to 60 
contiguous nucleotides of the nucleic acid molecule shown in SEQ ID NO.: 1 or 
SEQ ID NO.: 3; or (d) at least 95% identity to 70 contiguous nucleotides of the 
20 nucleic acid molecule shown in SEQ ID NO.: 1 or SEQ ID NO.: 3. 

In a second related aspect, the invention features a substantially purified 
nucleic acid molecule comprising a cardiac-specific enhancer element derived 
from a human, wherein the enhancer clement has at least 60% identity to 50 
contiguous nucleotides of the nucleic acid molecule shown in SEQ ID NO.: 1, 



SEQ ID NO.; 2, or SEQ ID NO.: 3. Preferably, the element has at least 70% 
identity to 50 eontiguous nucleotides of the nucleic acid molecule shown in SEQ 
ID NO.: 1, SEQ ID NO.: 2, or SEQ ID NO.: 3. More preferably, the identity is at 
least 80%, and most preferably, the identity is at least 90%, when compared to 50 

5 contiguous nucleotides of the nucleic acid molecule shown in SEQ ID NO.: I, 
SEQ ID NO.: 2, or SEQ ID NO.: 3. 

Preferably, when expressed //; vivo, the enhancer element is active in all 
four cardiac chambers. The enhancer element of the first or second aspect may be 
naturally occurring, or it may be non-natural ly occurring. 

0 Preferably, the enhancer clement of the first or second aspect includes a 

binding site selected from the group consisting of MeO, dHAND, GATA, TGF-P, 
CarG, E-box, and Csx/Nkx2.5 binding sites. More preferably, the enhancer 
element includes at least two binding sites selected from this group. The enhancer 
element preferably also includes an Sp-1 binding site. 

5 In a third aspect, the invention also features a substantially purified non- 

naturally occurring nucleic acid molecule that includes at least three transcription 
factor binding sites selected from Mcf2, dHAND, GATA, TGF-P, CarG, E-box, and 
Csx/Nkx2.5 binding sites. More preferably, the nucleic acid molecule includes 
four transcription factor binding sites, and most preferably includes five 

0 transcription factor binding sites selected from the aforementioned group. 
Preferably, the nucleic acid molecule, when operably linked to a promoter, 
increases activity of the promoter by at least two-fold in a cardiac cell-specific 
manner. 

In a fourth aspect, the invention features a substantially purified nucleic 
5 acid molecule comprising an enhancer element having: (a) 100% identity to 50 



contiguous nucleotides of the nucleic acid molecule shown in SEQ ID NO.: 6; (b) 
at least 97% identity to 60 contiguous nucleotides of the nucleic acid molecule 
shown in SEQ ID NO.: 6; (c) at least 93% identity to 70 contiguous nucleotides of 
the nucleic acid molecule shown in SEQ ID NO.: 6; or (d) at least 90%) identity to 
5 100 contiguous nucleotides of the nucleic acid molecule shown in SEQ ID NO.: 6. 

In a fifth aspect, the invention features a substantially purified nucleic 
acid molecule that includes a cardiac-specific enhancer element derived from a 
human, wherein the enhancer has at least 45% identity to 50 contiguous 
nucleotides of the nucleic acid molecule shown in SEQ ID NO.: 6. Preferably, the 

10 element has at least at least 50%) identity to 50 contiguous nucleotides of the 

nucleic acid molecule shown in SEQ ID NO.: 6, more preferably, the element has 
at least 60% identity to 50 contiguous nucleotides of the nucleic acid molecule 
shown in SEQ ID NO.: 6, and most preferably, the clement has at least 75% 
identity or even 90% identity to 50 contiguous nucleotides of the nucleic acid 

1 5 molecule shown in SEQ ID NO.: 6. The element may be naturally occurring or 
non-naturally occurring. 

In a fifth aspect, the invention features a substantially purified nucleic 
acid molecule comprising 50 contiguous nucleotides that have a sequence that is 
that at least 90% identical to 50 contiguous nucleotides of the nucleic acid 

20 molecule of SEQ ID NO.: 4 or SEQ ID NO.: 5. 

In a sixth aspect, the invention features a DNA vector that includes the 
nucleic acid molecule of the first, second, third, fourth, or fifth aspects. The DNA 
vector can also have a promoter operably linked to a gene of interest. The gene of 
interest is preferably a cardiogenic gene (e.g., a gene encoding BMP2, BMP4, 

25 GATA4, dHAND, cHAND, MEF2C, IRX4, SRF, or Csx/Nkx2.5), a reporter gene 



(e.g., a gene encoding GFP, p-gal, alkaline phosphatase, chloramphenicol acetyl 
transferase, or luciferase), a gene encoding a selectable marker (e.g., a gene that 
provides resistance to neomycin, kanamycin, or hygromycin), or a gene encoding a 
therapeutic protein (e.g., a growth factor, a cytokine, an anti-apoptotic factor, a 
5 pro-apoptotic factor, or a protein that improves cardiac function or repair). 

In a seventh aspect, the invention features a method for inducing a cell 
to become a cardiac cell. The method includes (a) introducing into the cell or 
ancestor thereof a DNA vector that includes (i) the nucleic acid of the first, second, 
third, fourth, or fifth aspect; (ii) a promoter; and (iii) a cardiogenic gene operably 

10 linked to the promoter; and (b) placing the cell under conditions that result in 
expression of the cardiogenic gene operably linked to the promoter. Preferably, 
expression of the cardiogenic gene further enhances expression of cardiogenic 
genes by binding to cardiac-specific enhancer elements. 

In an eighth aspect, the invention feature a method for specifically 

15 expressing a gene in cardiac cells, said method comprising introducing into the cell 
or ancestor thereof a DNA vector that includes (i) the nucleic acid of the first, 
second, third, fourth, or fifth aspect; (ii) a promoter; and (iii) the gene operably 
linked to the promoter. Preferably, the nucleic acid allows expression of the gene 
in a cardiac cell and does not express said gene in at least one cell that is not a 

20 cardiac cell. 

In a ninth aspect, the invention features a method for determining the 
efficacy of a method of inducing target cells to produce or become cardiac cells, 
the method including: (a) introducing into at least one target cell (or an ancestor of 
the target cell) a DNA vector that includes (i) the nucleic acid of the first, second, 

25 third, fourth, or fifth aspect; (ii) a promoter; and (iii) a reporter gene operably 



linked to the promoter; (b) performing a method tor potentially inducing the target 
cells to produce or become cardiac cells; and (c) determining the number or 
percentage of cells that are reporter gene-positi\ e, \\ herein a higher number or 
percentage indicates a higher eftlcacy of the method of inducing stem cells to 
5 produce or become cardiac cells. Preferably, the target cells are stem cells such as 
bone marrow stem cells or embryonic stem cells. 

In a tenth aspect, the invention features a method for determining the 
efficacy of a method of inducing target cells to produce or become cardiac cells, 
the method including: (a) introducing into at least one target cell (or an ancestor of 

10 the target cell) a DNA vector that includes (i) the nucleic acid of the first, second, 
third, fourth, or fifth aspect; (ii) a promoter; and (iii) a gene, encoding a selectable 
marker, operably linked to the promoter; (b) performing a method for potentially 
inducing the target cells to produce or become cardiac cells;(c) performing a drug 
selection, wherein cells expressing said gene encoding the selectable marker are 

1 5 capable of surviving in the presence of the drug and cells not expressing the gene 
encoding the selectable marker are not capable of surviving in the presence of the 
drug; and (d) deteiTnining the survival of cells following drug selection, wherein a 
higher cell survival indicates a higher efficacy of the method of inducing stem 
cells to produce or become cardiac cells. In this method, step (b) can be performed 

20 before or after step (c). Preferably, the target cells are stem cells such as bone 
marrow stem cells or embryonic stem cells. 

In an eleventh aspect, the invention features a method of identifying a 
cardiac cell, including (a) introducing into the ccli (or an ancestor of the cell) a 
DNA vector that includes (i) the nucleic acid of claim the first, second, third, 

25 fourth, or fifth aspect; (ii) a promoter; and (iii) a reporter gene operably linked to 




the promoter, whereby the cell expresses the reporter gene if the cell is a cardiac 
cell; (b) allowing sufficient time for the reporter gene to be expressed in cardiac 
cells; and (c) identifying the cardiac cells by the presence of the reporter gene. 
Preferably, the method is performed /// vitro. 
5 In a twelfth related aspect, the invention features a method of 

substantially purifying a cardiac ceil from a heterogeneous population of cells, 
including: (a) introducing into at least a subset of cells in the population (or 
ancestors of these cells) a DNA vector that includes (i) the nucleic acid of the first, 
second, third, fourth, or fifth aspect; (ii) a promoter; and (iii) a reporter gene 

1 0 operably linked to the promoter, whereby a cell expresses the reporter gene if the 
cell is a cardiac cell; (b) determining whether a cell in the heterogeneous 
population is expressing the reporter gene; and (c) if the cell is expressing the 
reporter gene, purifying the cell from the heterogeneous population. 

In a thirteenth aspect, the invention features a method of expressing a 

1 5 gene encoding a therapeutic protein in a cardiac cell. The method includes 

introducing to the cell (or an ancestor of the cell) a DNA vector that includes (i) 
the nucleic acid of the first, second, third, fourth, or fifth aspect; (ii) a promoter; 
and (iii) a gene encoding a therapeutic protein operably linked to the promoter 
such that the gene is expressed in cardiac cells. The cell may be a cardiac cell, or 

20 it may be a cell that is capable of differentiating as a cardiac cell. The method may 
be performed //; vivo or in vitro. If the method is performed /// vitro, the cell (or a 
descendent of the ceil) can be grafted into a patient. 

In all of the foregoing aspects of the invention, an enhancer element is 
defined as a nucleic acid sequence that when present, (i) increases in a cardiac cell 

25 expression of a gene to which it is operably linked by at least 25%, (ii) allows for 
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gene expression in the heart tube prior to looping, (iii) allows for gene expression 
in all four heart chambers; or (iv) increases cardiac expression 100% more than it 
increases extracardiac expression. Preferably, in (i), the expression is increased by 
at least 50%, more preferably by lOO'X), and most preferably by 200%) 
5 As used herein, by "nucleic acid" is meant either DNA or RNA. A 

"nucleic acid molecule" may be a single-stranded or double-stranded polymer of 
deoxyribonucleotide or ribonucleotide bases. Unless otherwise specified, the left 
hand direction of the sequence of a single-stranded nucleic acid molecule is the 5' 
end, and the left hand direction of double-stranded nucleic molecule is referred to 

10 as the 5' direction. 

By "promoter" is meant a region of nucleic acid, upstream from a 
translational start codon, which is involved in recognition and binding of RNA 
polymerase and other proteins to initiate transcription. A "human promoter" is a 
promoter capable of initiating transcription in a human cell, and may or may not be 

1 5 derived from a human cell. A "Cv.\-/Ma'2. J promoter" is one derived from the 
promoter region of a Csx/Nkx2.5 gene and that, when opcrably linked to a 
heterologous nucleic acid molecule, is capable of initiating transcription of that 
molecule (when present in a transcription medium capable of supporting 
transcription) in a cardiac cell. 

20 By "enhancer element" is meant a nucleic acid sequence that, when 

positioned proximate to a promoter and present in a transcription medium capable 
of supporting transcription, confers increased transcription activity relative to the 
transcription activity resulting from the promoter in the absence of the enhancer 
domain. 

25 By "opcrably linked" is meant that two or more nucleic acid molecules 
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(e.g.. a nucleic acid molecule to be transcribed, a promoter, and an enhancer 
element) are connected in such a way as to permit transcription of the nucleic acid 
molecule in a suitable transcription medium. 

By "derived from" is meant that a the nucleic acid molecule was either 
5 made or designed from a second nucleic acid molecule, the derivative retaining 
important functional features of the nucleic acid molecule from which it was made 
or designed. 

By "expression construct" is meant a nucleic acid molecule that is 
capable of directing transcription. An expression construct of the present 
10 invention includes, at the least, a cardiac-specific enhancer clement and a 

promoter. Additional elements, such as a transcription termination signal, may 
also be included, as described herein. 

By "vector" or "expression vector" is meant an expression system, a 
nucleic acid-based vehicle, a nucleic acid molecule adapted for nucleic acid 
1 5 delivery, or an autonomous self-replicating circular DNA (e.g., a plasmid). When 
a vector is maintained in a host cell, the vector can cither be stably replicated by 
the cells during mitosis as an autonomous structure, incorporated within the 
genome of the host cell, or maintained in the host cell's nucleus or cytoplasm. 
By "cardiac cell" is meant a differentiated cardiac cell (e.g., a 
20 cardiomyocyte) or a cell committed to producing or differentiating as a cardiac cell 
(e.g., a cardiomyoblast or a cardiomyogenic cell). 

By "cardiac-specific enhancer element" is meant an element, operably 
linked to a promoter, that directs gene expression in a cardiac cell and does not 
direct gene expression in all tissues or all cell types. Cardiac-specific enhancers of 
25 the present invention may be naturally occurring or non-naturally occurring. One 
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skilled in the art will recogni/.e that the synthesis of non-naturally occurring 
enhancers can be performed using standard oligonucleotide synthesis techniques. 

By "plasmid" is meant an autonomous DNA molecule capable of 
replication in a cell, and includes both plasmids designed for expression and 
5 plasmids designed for nucleic acid replication. 

By "heterologous" is meant that the nucleic acid molecule originates 
from a foreign source or, if from the same source, is modified from its original 
form. Thus, a "heterologous promoter" is a promoter not normally associated with 
the duplicated enhancer domain of the present invention. Similarly, a heterologous 
10 nucleic acid molecule that is modified from its original form or is from a source 
different from the source from which the promoter to which it is operably linked 
was derived. 

By "substantially pure nucleic acid" is meant nucleic acid that is free of 
the genes which, in the naturally-occurring genome of the organism from which 

1 5 the nucleic acid of the invention is derived, flank the nucleic acid. The term 

therefore includes, for example, a recombinant nucleic acid which is incorporated 
into a vector; into an autonomously replicating plasmid or virus; or into the 
genomic nucleic acid of a prokaryotc or a eukaryote cell; or which exists as a 
separate molecule (e.g., a cDNA or a genomic or cDNA fragment produced by 

20 PCR or restriction endonuclease digestion) independent of other sequences. It also 
includes a recombinant nucleic acid which is part of a hybrid gene encoding 
additional polypeptide sequence. 

By "transgene" is meant any piece of a nucleic acid molecule (for 
example, DNA) which is inserted by artifice into a cell either transiently or 

25 permanently, and becomes part of the organism if integrated into the genome or 
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maintained extrachromosomally. Such a transgene may include a gene which is 
partly or entirely heterologous (i.e., foreign) to the transgenic organism, or may 
represent a gene homologous to an endogenous gene of the organism. 

By "transgenic cell" is meant a cell containing a transgene. For 
5 example, a stem cell transformed with a vector containing the expression vector of 
the present invention operably linked to a heterologous nucleic acid molecule can 
be used to produce a population of cells having altered phenotypie characteristics. 

Other features and advantages of the invention will be apparent from the 
following description of the preferred embodiments thereof. 

10 Brief Description of the Drawings 

Fig. I is a schematic illustration showing the structure of the mouse 
Csx/Nkx2.5 {mCsx/NLx2.5) locus constructs utilized for transgenic experiments. 
Open boxes and shaded boxes represent untranslated sequences and coding 
sequence, respectively. Various subfragments of these two X phage clones were 
1 5 utilized for construction of transgencs (CsxlacZ- 1 through CsxlacZ-6). 

Fig. 2 is a schematic illustration showing a representation of lacZ 
expression pattern seen in the heart with the inCs.\/Nk.x2.5 constructs tested. 

Fig. 3 A is an alignment of human Csx/Nkx2.5 {liCsx/Nkx2.5) and 
iuCsx/NL\2.5 and sequence approximately 9 kb 5' to the ATG of exon 1 . Shaded 
20 area indicate the location of homology domains Al (top) and A2. The location of 
transcription factor binding sites is also indicated. 

Fig. 3B (top sequence) is an alignment of i)iCsx/Nkx2.5 and hCsx/Nkx2.5 
sequence approximately 3 kb 5' to the ATG of exon 1 . Within this sequence is 
homology domain B (SEQ ID NO.: 6). 
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l-'ig. 3C is a schematic illustration showing the genomic organization of 
mC.s\/j\'k.\2.3 and liCs.\/Nk.\2.5 . Homology domanis A-D are indicated. Note that 
in homology domain A, the human sequence is interrupted by sequence that has no 
identity to the niCsx/Nk.\2.5 sequence. 
5 Fig. 4A shows the nucleotide sequence for SHQ ID NO.; 4, -7.5 kb of 

hCs.x/Nk.x2.5 genomic sequence, the genomic location of which is shown on Fig. 6. 

Fig. 4B shows the nucleotide sequence for SEQ ID NO.: 5, ~6.8 kb of 
hCsx/Nk\2.5 genomic sequence, the genomic location of which is shown on Fig. 6. 

Fig. 5 A shows the nucleotide sequence of niCsx/NL\2.5 homology 
10 domains Al (SEQ ID NO.: 1 ) and A2 (SEQ ID NO.: 2). 

Fig. 5B shows the nucleotide sequence of /}Cs.\/Nk.\-2.5 homology 
domains Al and A2 and the intervening sequence that has no counterpart in the 
niCs.x/Nkx2.5. 

Fig. 5C shows the nucleotide sequence of /tCs.x/Nk.x2.5 homology 
1 5 domains B (SEQ ID NO.: 6). 

Fig. 6 is a schematic illustration showing the transgenic constructs of the 
hCsx/Nkx2.5 enhancer elements that were tested for cardiac-specific expression. 
The locations of SEQ ID NO.: 4 and SEQ ID NO.: 5 are also shown. 

Fig. 7 is a table summarizing the transgenic analysis of the hCsx/Nkx2.5 
20 enhancer elements. The constructs correspond to those shown in Fig. 6. 

Fig. 8 is a series of photographs showing transgene expression in mice 
that resulted from the indicated cnhancer-hsp68-/c/cZ constructs. 

Fig. 9 is a series of photographs showing transgene expression in mice 
that resulted from the 7.5 kb enhanccr-hspCS-Zr/rZ constructs. 
25 Fig. 10 is a schematic illustration showing various uses of cardiac- 
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specific enhancer elements for facilitating the production or purification of cardiac 
cells. 

Detailed Description of" the hncntion 
5 We have cloned cardiac enhancers from /iC's.\/Nk.\2.5 and niCs.\/Nk.x2.5 

that, when used to drive transgene expression in mice, recapitulate expression 
patterns of the endogenous mCs.x/Nk.\2.5. One of the hCs.x/Nk.\2.5 enhancer 
constructs, 7.5 kh-\\s,p-lacZ, constructed by operably linking 7.5 kb of genomic 
region from hCsx/Nkx2.5 (SEQ ID NO.: 4) to an hsp68 promoter-/acZ cassette, is 

10 active as early as E7.5, the earliest time point of cardiac development, and 

recapitulates the expression pattern of endogenous mCsx/Nk\2.5 at E9.5, El 0.5, 
and El 2.5. Thus, when the four chambei^s of the heart are formed at El 2.5, the 
enhancer is active in all four chambers. Among the mammalian cardiac enhancers 
known so far, this 7.5 kb enhancer is the earliest enhancer that is active in all four 

15 chambers. Moreover, this enhancer displays no ectopic expression. 

Within this 7.5 kb fragment, we hav e further isolated two regions 
(referred to herein as homology domain Al (HDAl; SEQ ID NO.: 1) and 
homology domain A2 (HDA2; SEQ ID NO.: 2) that together, when operably 
linked to an hsp68 promoter-Zr/c Z cassette, are capable of enhancing gene 

20 expression in a cardiac-specific manner. 

Prior art methods of OMG cell isolation are time consuming and 
produce low cell yields. Generally, cardiomyocytes are detected by their beating. 
The proportion of beating cardiomyocytes from primary bone marrow stem cells is 
very low, and even after immortalized CMCj cells are established, approximately 

25 70% of cells derived from them arc not beating. The failure to achieve a high 



percentage of beating cardiomyocytcs may be due to the incomplete induction of 
the cardiomyocytc phenotypc by 5-a/acytidine. Due to the extended time period 
currently required to isolate CMG cells, it was previously difficult to improve 
cardiomyocytc induction methods. Beating is one of the last phenotypes to be 
5 exhibited by cardiomyocytcs. The present invention pro\ idcs an earlier marker of 
cardiomyocytc induction. This is useful, for example, for optimizing 
cardiomyocytc induction methods. By reducing the length of time required for 
testing, one will be able to more rapidly improve the efficacy of induction of the 
desired cells. The present invention also provides methods for enriching for 
10 cardiomyocytcs by, for example, sorting based on expression of a reporter gene 
operably linked to a Csx/NL\2.5 cardiac cell-specitlc enhancer. These and other 
uses are described in greater detail in the examples. 



Example 1 : Characterization niCsx/NL\2.5 enhancer elements 

As an initial step in defining the regulatory region of Csx/Nkx2.5 , we 

1 3 determined the 5' end of the mCsx/Nkx2.5 transcript by 5' RACE with total RNA 
from hearts of three-week-old mice. Three different transcripts which were 
identical for most of their length, except for distinct 5' ends, were identified 
(transcripts I, II, and III). The sequence comparison with the genomic DNA 
sequence showed exon - intron boundaries in accordance with the consensus exon 

20 - intron junction sequences. Therefore, we named the most 5' cDNA segment exon 
la, the most 3' cDNA segment exon lb, and the intervening genomic sequence 
intron 1 . Likewise, the exon that contains the start codon is termed exon Ic, 
flanked by introns 2 and 3, and followed by the last exon. exon 2, which includes 
the homeodomain (HD) and TAG translation stop codon. 
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Transcript I contains two segments of cDNA at its most 5' end which are 
discontinuous in the genomic sequence. The most 5' segment of transcript I is 46 
bp in length (exon la) and is 669 bp upstream from the second discontinuous 
segment, fhe second discontinuous segment is 36 bp long (a part of exon lb) and 

5 3504 bp upstream from the translation start codon. for transcript 11, exon lb is the 
most 5' sequence and is extended more 5' than in transcript 1, in this case 93 bp 
long, including the 36 bp internal fragment found in transcript 1. In transcript III, 
the first exon at the 5' end, exon Ic, includes 71 bp of continuous genomic 
sequence which is spliced out in transcripts I and II. 

t) To confirm the existence of the novel cxons, we performed primer 

extension analysis. With adult mouse heart poly(A) RNA, two bands were 
observed which correspond to the sizes of transcripts II and 111. The band 
representing transcript III was much stronger, suggesting that it is the major site of 
initiation. Since we could not detect a band corresponding to the transcript I by 

5 primer extension analysis, we performed RT-PCR using total RNA samples from 
several tissues. With exon la-specific and exon Ic-spccitlc primer pairs, RT-PCR 
generated products of the predicted size (112 bp) from the heart and spleen. No 
band was amplified with RNA from kidney, which does not express Csx/Nkx2.5. 
By Southern blotting, this 1 12 bp band was shown to include exon lb using an 

0 exon Ib-specific oligonucleotide probe. 

To further characterize the genomic structure of the inCsx/Nkx2.5 gene, 
we subcloned and sequenced 8.6 kb of genomic DNA. The 3' end of exon la is 
located 4265 bp upstream from the ATG codon, followed by intron 1 (668 bp), 
exon lb (93 bp), intron 2 (3179 bp), exon Ic (655 bp) that includes the ATG 

5 translation start site, intron 3 (1377 bp), and exon 2 (983 bp) with TAG stop codon 
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and a poly(A) addition signal. The genomic sequences around tlie three putative 
transcription initiation sites siiow no I A l A box, indicating that inC.s.\-/Nk.\2.5 is a 
TATA-less gene. 

3.3 cind 4 kh of 5' Jhinkiiif^ sequence drive IdcZ expression in flic outflow 
tract cind based portion of the rii^lit ventricle as well as in the pharynx, 
thyroid priinordiuni, and stomach 

Since the major /^/Cv.v/M.vi.i transcript (transcript III) starts 325 bp 
upstream from the ATG codon, we searched for cnlianccr elements in the genomic 
region immediately upstream to exon Ic. The 3.3 kb of 5' flanking sequence 
between the Not\ site and the ATG codon was fused to the lacZ reporter gene 
(CsxlacZ-1 ; Fig. 1 ) and injected into mouse embryos to generate transgenic mice. 
Of 40 embryos examined, eight embryos carried the transgene and three were 
noted to have P-gal staining. When the /^cZ expression was assessed in these 
embryos at El 0.5, we found that this 3.3 kb upstream region could drive lacZ 
expression in cardiac and cxtracardiac tissues. Myocardial cells in the outflow 
tract and the basal part of the right ventricle were strongly positive for lacZ, and 
there were also a few positive cells in the trabecular layer of the right ventricle. In 
the outflow tract, lacZ was strongly expressed in the myocardium, but no lacZ 
expression was detected in the aortic sac or endocardial cushions. 

At EI0.5, cxtracardiac expression of /r/rZ was also observed in locations 
where the endogenous niCsx/Nkx2.5 was detected, such as the pharyngeal floor, 
thyroid primordium, and in the distal part of the stomach. In the stomach, lacZ 
expression was observed in mesenchymal cells. Ectopic expression of lacZ was 
observed in the surface ectoderm of pharyngeal arches and the laryngotracheal 



groove. The lacZ expression pattern in both cardiae and extraeardiac tissues was 
substantially identical in all three /^/rZ-positi\ e embryos, with the exception of an 
ectopic expression in the glossopharyngeal ganglions in one embryo. 

The C'sxlacZ-2 construct included the same 3.3 kb of 5' flanking 
sequence plus 1 .4 kb of intron 3 sequence (Fig. 1 ). Of 34 embryos, six carried the 
C'sxlacZ-2 transgcne and, of these, three had positive P-gal staining. Cardiac 
expression was seen in the basal portion of the right ventricle and in the outflow 
tract, and extraeardiac expression was seen m the pharyngeal floor, the thyroid 
primordium, and the stomach in two embryos. In one embryo, /c/cZ expression 
was noted in the pharyngeal floor but not in other sites. This lack of expression 
was probably due to a negative effect of the transgene integration site. 

The addition of most of intron 1 and exon 1 b to the CsxIacZ- 1 construct 
yielded CsxlacZ-3 (Fig. 1 ). Two embryos were positive for CsxlacZ-3 transgene, 
with one embryo having positive P-gal staining. The lacZ expression was observed 
in the outflow tract, right ventricle, pharynx, thyroid primordium, and stomach. 
Thus, the cardiac P-gal staining pattern was quite similar for constructs CsxlacZ-1, 
C'sxlacZ-2, and CsxlacZ-3. The extraeardiac expression pattern was also very 
similar except for the sites of ectopic expression. 

CsxlacZ-4 contains 6 kb of 5' flanking sequence. Twenty-two injected 
embryos were analyzed, and ten found to carry the C'sxlacZ-4 transgene. 
Interestingly, none of the CsxlacZ-4 transgenic embryos showed any P-gal staining 
in the heart nor in any extraeardiac tissues. This finding indicates that a strong 
negative regulatory clement exists between 6 kb and 4 kb 5' to exon Ic (Fig. 2). 



A cis-re^ulatory clement for expression oj inCsx/Nkx2.5 in the medial 
Weill and inner traheeiilae of the rii^lil and left ventricles exists between 
14 and 6 kh of 5' flankinii sequence. 

With 14 kb of 5' flanking sequence dri\ ing the reporter gene (CsxlacZ- 
5 5), lacZ was expressed more widely in the heart than with CsxlacZ-1. P-gal staining 
was observed in the entire right and left ventricles, except for the compact layer of 
the lateral walls. In the atrium, there was a clustei" of /c/rZ-positive myocardial 
cells in the interatrial groove, which is above the future atrial septum. P-gal staining 
was also seen in the outflow tract as in CsxlacZ-1 . Cardiac staining was similar in 
10 all four transgenic embryos with positive P-gal staining. 

Extracardiac expression of CsxlacZ-S was observed in the pharyngeal 
floor and thyroid primordium in all four C'sxlacZ-5 transgenic embryos. Only one 
of the four CsxlacZ-5 embryos was noted to have weak P-gal staining in the 
stomach, in contrast to strong P-gal staining in the stomach of transgenic embryos 
15 harboring CsxlacZ-1, -2, or -3 constructs. No ectopic expression was noted in any 
of the CsxlacZ-5 embryos, indicating the presence of repressor elements that 
restrict expression of mCsx/Nkx2. 5 in a tissue-specific manner. 

An enhancer for the right ventricle is located within 6 kh of the 3' 
f (inking sequence 

20 To examine cis-regulatory elements downstream of the coding sequence 

of mCsx/Nkx2.5, we linked 6 kb of 3' flanking sequence to the CsxlacZ-1 construct 
to create CsxlacZ-6 (Fig. 1). Of 38 embryos examined, 1 2 carried the CsxlacZ-6 
transgcne, of which 6 were /I'/fZ-positivc. fhrce transgenic embryos had P-gal 
staining in the entire right ventricle, including the compact layer of the lateral 
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walls. The other three had no eardiae staining, but did have weak cxtracardiac 
staining. Extraeardiae staining in all six embryos included the stomach, pharynx, 
and thyroid primordium. 



Different expression patterns of t/ie reporter i^ene in adult transgenic 

5 mice 

In addition to the evaluation of transient transgenic embryos described 
above, lines of transgenic mice were created and Fl transgenic mice were 
evaluated at different time points in development. 

In the transgenic C'sxlacZ-1 line, which carried the 3.3 kb 5' flanking 

10 sequnece, no P-gal staining could be observed at E7.5. At E9.5, the pattern of 

cardiac and extraeardiae /c/cZ expression was identical to that seen in the transient 
transgenic embryos. At El 5.5, /c/cZ expression was also observed in the spleen, 
but not in the tongue. In the EI 5.5 heait, lacZ expression was reduced, but 
detectable in the outflow tract region of the right ventricle, while the transgene 

1 5 expression in the thyroid gland was still strong. Interestingly, when adult animals 
of the same transgenic CsxlacZ-I line were analyzed, P-gal staining was completely 
absent in the hearts. 

In the transgenic CsxlacZ-5 line, which earned the 14 kb 5' flanking 
sequence, P-gal staining was observed in the cardiac crescent at E7.5, in the 

20 common ventricle and outflow tract at E8.25, and in the right ventricle, left 

ventricle, septum, and outflow tract at E9.5. At El 5.5, the cardiac P-gal staining 
was significantly down-regulated, except for patchy expression in the A V junction 
and in the interventricular septum. The spleen was also stained, but the tongue 
was not at EI 5.5. Analysis of adult C'sxlacZ-5 transgenic mice showed patchy P-gal 
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staining along the luminal surface of the right ventricle, particularly along the 
ventricular septal surface. P-gal staining was also seen in a small part of the left 
ventricle base and in the AV junction area. 



Aiirorcgtilafioii ofmCsx/Nk.x2. 5 expression 
5 One of the mechanisms to ensure tissue-specific expression of a 

transcription factor is a positive autoregulation of its own promoter, as has been 
shown for the MyoD gene. The inCsx/Nkx2.5 promoter contains multiple NKE 
elements, the binding sites for Csx/Nkx2.5 protein (Chen and Schwartz, J. Biol. 
Chem. 270:15628-15633, 1995). In order to determine whether positive 

10 autoregulation of mCsx/NLx2.5 is present, we examined lacZ expression in mice 
homozygous null mutant for mCsx/Nkx2.5 created by homologous recombination. 
Since lacZ expression is under the control of 5' and 3' regulatory regions of 
mCsx/Nkx2.5 in the mutant embryos, P-gal staining will be weaker or absent in 
homozygous null embryos if positive regulation by mCsx/Nkx2.5 is present. 

1 5 Unexpectedly, P-gal staining was far stronger in the homozygous mutant embryos 
than in the heterozygous mutant embryos stained simultaneously. These data 
indicate that a negative feedback, either directly or indirectly, of mCsx/Nkx2.5 
exists, since the intensity of P-gal staining in the homozygous mutant heart was 
much more than double that in the heterozygous mutant heart. To confirm this 

20 result, we performed semi-quantitative RT-PCR using RNA extracted from the 
heart of heterozygous and homozygous mutant embryos at E9.5. Transcripts for 
lacZ in homozygous mutant hearts were approximately 8-fold those in 
heterozygous mutant hearts, after adjustment using the relative abundance of the 
transcript for a-cardiac actin. 



The foregoing results were achieved with the following materials and 

methods. 

Isolafiofi of genomic clones and sequencing 

Two lambda phage genomic DNA clones containing the mCsx/Nkx2.5 
5 coding and flanking sequences were isolated from a 129 mouse genomic library in 
ADASH II. Clone 1 contained 14 kb of 5' flanking region in addition to a portion of 
the mCsx/NLx2.5 coding region. Clone 2 contained 6 kb of 5' flanking region, the 
iuCsx/Nkx2.5 coding region, and 6 kb of 3' flanking region. In addition, a mouse 
PI clone (Genome Systems, St. Louis, MO) which contained the mCsx/Nkx2.5 
10 coding sequence, as well as more than 10 kb 5' and 3' flanking sequence, was 

isolated. Restriction enzyme mapping and serial Southern blotting were performed 
using standard techniques. 

Definition of 5' end ofniRNA 

5' RACE was performed following previously described methods with 
15 some modifications (Reecy et al., Dev. Biol. 188: 295-31 1, 1997). The first strand 
cDNA synthesis reaction was performed using 5 |j.g total RNA from the heart of 
three- week old mice and 100 ng of random primers (Promega, Madison, WI) in a 
40 III volume containing 50 mM Tris-HCl (pH 8.3), 75 mM KCl, 3 mM MgClj, 10 
|iM DTT, 40 U RNase inhibitor (Promega), 0.5 mM each deoxynucleotide, 200 U 
20 M-MLV reverse transcriptase (Life Technologies, Gaithersburg, MD) for 90 

minutes at 37"C. The cDNA was purified using a QIAQUICK™ PCR purification 
column (QIAGEN, Santa Clara, CA) after incubation with 5 |ig RNase A 
(Ambion, Austin, TX). Ligation-anchorcd PCR was performed as described by 
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Ali Aiisari-Lari et al. (BioTcchniques 21 : 34-36, 1996) with some modifications. 
Initially, 3 nmoles of a 3' amino-modificd 5' phosphorylatcd anchor primer (5'- 
TC V CTA CTC CGA ATT CX G TCG TC C AGA C GT-3'; SEQ ID NO: 7) was 
end ligated to one half of the purified first strand cDNA using 20 U of T4 RNA 
5 hgasc (New England Biolabs, Beverly, MA) in a 50 |.iL reaction volume 

containing 50 mM Tris-HCl (pH 8.0), 10 mM MgCf , 1 niM hcxamine cobalt 
chloride, 20 |iM ATP, 25% PEG 800 and incubated for 24 hours at 16"C. The 
anchor-ligated cDNA was further purified using a QIAQUIGK PGR purification 
column. The first round PGR was performed using one-tenth of the purified 

1 0 anchor-ligated cDNA, an anchor-specific primer (5'-AGG TGT GGA CGA CGG 
AAT TCG GAG TAG AGA-3'; SEQ ID NO: 8) and a C.s.v/yV^vi.J-specific primer 
(5'-GGG GGC GGC TGG GAA AGC AGG AGA GCA GTT-3'; SEQ ID NO: 9). 
PGR conditions were as follows: denaturation at 94"C for one minute, annealing at 
60"C for one minute, extension at 72"C for two minutes with an additional five 

1 5 minutes incubation at 72"C at the end of the 30 cycles. Subsequent PGR reactions 
were performed using 5 )liI of each PGR product, a nested anchor primer (5'-CGA 
CGG AAR TCG GAG TAG AGA-3'; SEQ ID NO: 10), and one of the 
CWA^Ax2.5-specific primers (5*-TTG A AG GCG GGC AGC ATG GAG GAG 
GCA-3' (SEQ ID NO: 1 1) or 5'-ACA GGA GCG AGG GGC AGT TGT GCG T-3' 

20 (SEQ ID NO: 12)) at the same PGR conditions as above. The PGR products were 
visualized on a 2% agarose gel with ethidium bromide, and subsequently 
subcloncd and sequenced. 
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Detection of exons lb and Ic by primer extension 
Five microgram samples of mouse heart poly(A) RNA were 
coprecipitatcd with 2 x 10"* cpm end-labeled, gel-puritled oligonucleotide probe 
(5'-CGG AGC ACC AGG GGC AG A AG A GGG-3'; SEQ ID NO: 13). The 
5 samples were rcsuspended in 10 |iil annealing buffer (0.1 M NaCl, 0.01 M, pH 8.0, 
0.001 M EDTA), heated to 85"C for five minutes, the incubated at 37"C four two 
hours. Twenty microliters of reverse transcription buffer ( 10 mM DTT, 16 mM 
MgCI2, 1 mM dNTP, 1 U/^1 RNasin, 0.1 M Tris, pH 8.0) was added to each 
sample as well as 20 U Superscript™ reverse transcriptase (Life Technologies). 
10 Reverse transcription was allowed to proceed at 4()"C for one hour, and 7.5 \i\ of 
each sample was mixed with an equal volume of formamide loading buffer. After 
denaturation at 95"C for five minutes, 7 |iil of each sample was loaded onto an 8% 
denaturing polyacrylamide gel. After electropheresis, the gel was dried and 
exposed to film. 



1 5 Detection ofexon la by RT-PCR 

RT-PCR was performed using rTth DNA polymerase (rTth RT-RNA 
PCR kit; Perkin Elmer, Branchburg, NJ) following the manufacturer's protocol. 
Reverse transcription was performed using an exon Ic-specific primer (5'-ACA 
GGA GCG ACG GGC AGT TCT GCG T-3'; SEQ ID NO: 14), and the subsequent 

20 PCR reaction was performed by adding an exon la-specific primer (5'-GAG TGC 
TCT GCC TGA TGA TC-3'; SEQ ID NO: 15) to the RT reaction according to the 
manufacturer's instruction. The PCR reaction consisted of 35 cycles of 
denaturation at 95"C for ten seconds, annealing/extension at 55"C for 15 seconds, 
and, at the end of the 35 cycles, an additional final extension at 55"C for seven 
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minutes. 1 he PCR products were \ isuali/ed on a 3'X) agarose gel with ethidium 
bromide, and a Southern analysis was performed using a --P end-labeled exon Ib- 
specitlc primer (5'-CCA GTC TAG AAG GG(i I GA TCG GCA-3'; SEQ ID NO: 
16). 

5 Construction of reporter i^ene constructs, {^enerotlon cuul analysis of 

transgenic mice 

An Xha\-Pst\ lacZ cassette from pnlacF (Bonnerot et al., Proc. Natl. 
Acad. Sci. USA 84: 6795-6799, 1987) was subcloned into the Xha\ and Pstl sites 
of pBluescript SK- (Stratagene) and regions of the genomic tnCsx/Nkx2.5 DNA 

1 0 were cloned 5' or 3' to the lacZ cassette. The GsxlacZ- 1 construct included 3.3 kb 
of genomic sequence, including most of intron 2 and the beginning of exon Ic, 
between the Not\ site and the ATG codon (Fig. 1 ). An Xho\-Pst\ fragment 
containing part of exon Ic, intron 3, and part of exon 2 was subcloned 3' to the 
laeZ gene in CsxlacZ-1 to make the construct GsxlacZ-2 (Fig. 1 ). The CsxlacZ-3 

1 3 construct contains a 4 kb fragment between the Spe\ site and the ATG codon of 
Csx/Nkx2.5, thus including a part of intron 1, exon lb, intron 2, and part of exon 
Ic (Fig. 1 ). CsxlacZ-4 and CsxlacZ-5 contain 6 kb and 14 kb, respectively, of 
upstream 5' Csx/Nkx2.5 genomic sequence cloned upstream of the lacZ gene. A 6 
kb long 3' downstream genomic fragment was fused 3' to the lacZ gene in 

20 CsxlacZ-1 to create the construct CsxlacZ-6 (Fig. 1). 

For microinjection of CsxlacZ-1, -2. -3, and -6, inCsx/Nhc2.5 genomic 
sequence together with the /(T/c Z cassette portion of each construct was excised 
from the vector \\s\ng Xho\ and Not\ and purified by gel electrophoresis and 
Geneclean III (BiolOl, Vista, GA). In preparation for microinjection, CsxlacZ-4 
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and C'sxlacZ-5 were linearized with Xho\, then purified as indicated above. The 
creation of transgenic mice was done using standard methods (Hogan et al. (1994) 
Manipulating the Mouse Embryo . C\^!d Spring Harbor Laboratory Press, New 
York). 

5 For transient transgenic analysis, F„ embryos were dissected at El 0.5. 

Subsequent genotyping was performed using PC'R on chromosomal DNA isolated 
from the yolk sacs. PCR primer pairs used for detection of the transgenes were as 
tbllows: 5'-CCG TCC GAT GAA AAA GAG GAG-3' (SEQ ID NO: 17) and 5'- 
TC T GGT CTT CGT TGG CTG ATG-3' (SEQ ID NO: 1 8) for CsxlacZ-1, -2, and 

10 -3; 5'-GGG TGC GAT GAA AAA C^AG CiAG-3' (SEQ ID NO: 17) and 5'-TTA 
AGT TGG GTA ACG CCA GGG-3' (SEQ ID NO: 19) for CsxlacZ-4 and -5; and 
5'-AAC TTG CTA GGT AGA CTA GGC TGG C-3' (SEQ ID NO: 20) and 5'-TCT 
GCT CTT CGT TGG CTG ATG-3' (SEQ ID NO: 1 8) for CsxlacZ-6. 

Whole-mount p-gal staining was performed according to the method of 

15 Schlaegcretal. (Development 121: 1089-1098, 1995). After photographs were 
taken of the whole mounts, embryos were dehydrated through graded ethanol and 
xylene, embedded in paraffin, sectioned, and counterstained with Nuclear Fast Red 
(Vector Laboratories, Burlingame, CA). 

(leiieratio/i of Dioiisc lines earn ing CsxlacZ-l and CsxlacZ-5 
20 For CsxlacZ-1 and CsxlacZ-5, we established transgenic lines of mice. 

The F,, mice carrying each transgene were backcrosscd with FVB mice and F, 
embryos w ere examined for lacZ expression. We identified one line of transgenic 
mice for each construct and analyzed them at different time points, including E7.5, 
E8.25. E9.5. EI 5,5, and adult. 



(iciieration of Cs.x/Nkx2 .5 knock-out din/ lacZ knock-in mice 

The entire coding region inCsx/Nk.\2.5 was replaced with a lacZ- 

ncomycin resistance gene {Nco) cassette by homologous recombination in a 

standard manner (Hogan et al., supi-(i). 



5 /// situ hybridization 

In situ hybridization was performed as described previously (Tanaka et 
a!., Dev. Genet. 22: 239-249, 1998). Briefly, embryos were fixed in 4% 
formaldehyde at 4"C overnight and embedded in paraffin. Tissue sections were 
hybridized with a '"^S-labelcd Csx/Nkx2.5 cRNA probe at 55"C, washed and treated 
10 with RNasc A. After emulsion autoradiography, sections were counter-stained 
with hematoxylin and eosin. 



Example 2: Isolation of hCsx/Nkx2.5 promoter and cardiac-specific elements 

We have cloned an approximately 20 kb-long genomic clone containing 
about 17 kb upstream fianking sequence, as well as the first exon and a part of 
15 intron of the hCsx/Nk\2.5, from a Lambda FlXll genomic library (Stratagene) 
using radiolabeled hCsx/Nkx2.5 cDNA probes. To narrow down the cardiac 
enhancer, various regions of the upstream flanking sequence from the 20 kb clone 
were fused to the hsp68 promoter-linked lacZ reporter, hspOS-ZczcZ (Kothary et al.. 
Development 105: 707-714 1989) (Figs. 3A-3C\ 6). The expression patterns of 
20 these constructs were tested using transgenic mice to define the enhancer region of 
the cardiac specificity. 

Each construct was linearized at immediate 5' end of the subcloned 
flanking sequence, phenokchloroform (1:1) and chloroform extracted one time 
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each, alcohol precipitated, and filtered using a 0.22 urn pore size microliter unit 
(Eppendorf 5 prime. Boulder, CO) after resuspendcd w itli deionized water. For 
transient transgene analysis, F,, embryos were collected at either E9.5 or El 0.5, 
formaldehyde fixed, PBS washed, and examined for the expression of the P- 
5 galactosidase. Genotyping was performed using PC'R on the chromosomal DNA 
fVom the yolk sacs. After the whole body embryos were incubated at 30"C for 12- 
14 hours in X-gal containing buffer, photographs of the /('/c Z positive embryos 
were taken. Consequently, the IcicZ stainmg positive embryos were paraffin 
embedded, sectioned, and counter stained using Nuclear Fast Red. Some of the 

10 transgenic mice were established in the whole embryos (at E7.5 and El 2.5) or in 
the hearts (three days after birth) (Figs. 7-9). 

The results of the transgenic study using the constructs described above 
allowed us to identify an approximately 7.5 kb-long enhancer sequence which 
showed expression patterns very similar to the niCs.x/NL\2.5 throughout the 

1 5 embryonic development; at E7.5 in the cardiac crescent, at E9.5 in the first 
branchial arch and the heart, and at El 2.5 in the four chambers of the heart 
including the outflow tract. There was a weak expression of the lacZ also in the 
stomach region at El 2.5. In the three day-old mouse, however, the enhancer 
activity of the 7.5 kb upstream flanking sequence of the l}Csx/Nkx2.5 results in a 

20 expression pattern than that observed for the endogneous mouse gene. The 7.5 kb 
hCsx/NLx2.5 cnhancer-hsp68 promoter-/^/cZ is active in the right ventricle, but not 
in the atria or the left ventricle. 

We sequenced the 7.5 kb enhancer region of /iCsx/Nk\2.5 (Figs. 4A and 
4B), and found regions of high sequence homology between the mouse and the 

25 human Csx/Nkx2.5 enhancers (Figs. 3A-3C, 5A, 5B). When the homologous 
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regions of the liCs.\/Nk.x2.5 eardiac enhancer (homology domains Al and A2) were 
separated from each other, neither part showed enhancer activity. When they were 
religated by removing the interv ening non-homologous sequence, however, the 
cardiac enhancer activ ity w as regained, implying that both of the homologous 
5 regions are required for the cardiac enhancer activity. In these homologous 

sequences, we found several putative transcription fiictor binding sites, which are 
important in cardiac development, such as GATA4, IV1EF2C, dHAND, SRE, E- 
box, Sp-1, TGF-b responsive element, and Csx/Nkx2.5 binding sites. 

Example 3: Use of cardiac-specific enhancer elements in the differentiation and 
1 0 and purification of cardiac cells from stem cells 

Stem cells {e.g.. embryonic stem (ES) cells and bone marrow stem cells) 

are capable of being induced into differentiating into cardiac cells. When mouse 

ES cells differentiate as cardiac cells, endogenous iiiCs.x/Nk.x2.5 is expressed. 

Thus it is highly likely that the enhancer elements of the present invention will also 
1 5 be active in stems cells of all types that differentiate as cardiac cells (Fig. 10). 

Optimizing induction ofcardiac cells 

As is described above, current methods for inducing the differentiation 
ofcardiac cells from stem cells have a low efficacy. Thus, in one aspect, the 
invention features a method for optimizing the efficacy ofcardiac cell induction. 
20 In one example, a population of stem cells arc transformed with a DNA vector that 
includes a reporter gene (e.g., a gene encoding GFP, P-galactosidase, or alkaline 
phosphatase) operably linked to a promoter and a cardiac-specific enhancer. 
Cardiac cell differentiation is then induced by one or more induction methods. 



Reporter gene expression is determined at the appropriate time: methods that result 
in a huge number or percentage ol~ reporter-positi\ e cells are identified as suitable 
methods for the induction of cardiac cells. Because reporter gene expression will 
precede the beating phenotypc currently used for scoring cardiac cell induction, the 
5 present method can be performed in a shorter time period and at a lower cost than 
prev ious methods. 

Selecting for or sorting out ccirdiac cells 

As the number of cardiac cells that differentiate from stem cells can very 
low, the invention also features a method of selecting for cardiac cells (and thereby 
10 eliminating non-cardiac cells). In one example of this method, a gene encoding a 
selectable marker (e.g., neomycin) is operably linked to a cardiac cell enhancer- 
promoter construct. This construct can be introduced into stem cells or, 
alternatively, used to produce a transgenic animal (e.g., a transgenic mouse or a 
transgenic pig), from which stem cells are then isolated. Following induction of 
1 5 cardiac cell differentiation, selection is performed for the appropriate time. During 
selection, non-cardiac cells die, resulting in an enrichment for cardiac cells. 

It is likely that some stem cells express Csx/Nk.\2.5 even prior to 
induction. These cells may have a greater capacity for producing cardiac cells. 
Accordingly, in another example, the selection step described above is performed 
20 prior to the induction step. 

In a related aspect, the invention features a method for separating 
cardiac cells or stem cells having a greater capacity for producing cardiac cells. In 
this example, the construct is as described above, except that the gene encoding a 
selectable marker is replaced with a reporter gene that allows for identification of 



4 

living cells (e.g., GFP). Reporter gene-positive cells can be identified and 
separated from reporter gene-negative cells using any appropriate method (e.g., 
Iliiorescence-activated cell sorting). Again, as in the previous method, the sorting 
step can be performed prior or subsequent to the induction step. 

5 Augmenting cardiac cell differentiation 

Several genes (e.g., BMP2/4^ the Wnt family of genes, dHAND, eHAND, 
MEF-2C\ (!ATA4, SRF, p300, IRX4 and C ',v.v/M'.v2. J ) have been implicated in 
cardiogenesis. Some of these cardiogenic genes arc expressed earlier than 
Csx/Nk\2.5, while others arc concomitant or later that Csx/Nkx2.5 . Regardless of 

10 their temporal expression, it is likely that expression of one of the foregoing genes 
under the control of a cardiac specific enhancer, such as those described herein, 
will result in an augmentation of cardiac cell differentiation (i.e., accelerated 
differentiation, an increase in the percentage of cardiac cells, or both). Therefore, 
the invention features methods for augmenting cardiac cell differentiation from a 

1 5 population of stem cells by expressing a cardiogenic gene under the control of a 
cardiac specific enhancer clement. Preferably, the method also includes a step in 
which cardiac cell differentiation is induced (e.g.. by introduction of 5- 
a/acytidine). 

Example 4: Use of cardiac-specific enhancer elements for gene therapy 
20 The use of gene therapy to introduce genes encoding therapeutic 

proteins into cardiac cells may be carried out /// vivo or ex vivo. Thus, in one 
aspect, the invention provides a method of modifying cardiac cells of a patient in 
vivo by inserting into the cardiac cells a DNA \ ector that includes the gene 



encoding a therapeutic protein operably linked to a cardiac-specific enhancer of 
the invention. Nucleic acid molecules are pro\ ided in solution or in any other 
pharmacologically suitable tbrm tor administration. Preferably, the DNA vectors 
are administered directly to the heart, but can also be administered systcmically. 
5 There are many delivery methods known in the art for enhancing the 

uptake of nucleic acids by cells. Useful delivery systems include the Sendai virus- 
liposomc delivery systems (see Rapaport and Shai, .1. Biol. Chem. 269: 15124- 
15131, 1994), cationic liposomes, polymeric delivery gels or matrices, porous 
balloon catheters (Shi ct al.. Circulation 90: 955-951, 1994; and Shi et al.. Gene 

10 Therapy 1:408-414, 1994), intraluminal pressure ( PCT/US96/()627 1 , herein 
incorporated by reference), retrovirus expression vectors, and the like. 

The use of liposomes as delivery vehicles is one particular method of 
interest. The liposomes fuse with the cells of the target site and deliver their 
contents intracellularly. As noted above, liposomes are maintained in contact with 

1 5 the cells for a time sufficient for fusion, using various means to maintain contact, 
such as isolation, binding agents, and the like. Liposomes may be prepared with 
purified proteins or peptides that mediate fusion of membranes, such as those of 
the Sendai virus or influenza virus, etc. The lipids may be any useful combination 
of known liposome forming lipids, including, for example, cationic lipids, such as 

20 phosphatidylcholine. The remaining lipid will normally be neutral lipids, such as 
cholesterol, phosphatidyl serine, phosphatidyl glycerol, and the like. 

Liposomes may be prepared by the procedure described by Kato et al. 
(J. Biol. Chem. 266: 3361, 1991 ). This method allows for the incorporation into 
the lumen high molecular weight molecules, particularly nucleic acids of one 

25 kilobase pair or more. In this way DNA vectors may be introduced into cells 



efficiently. 

In another aspect, cell populations (or t)rgans) can be removed from the 
patient or donor animal, modified c v vivo by insertion of the DNA vector, and re- 
implanted into the patient or transplanted into another recipient. Methods for 
5 grafting cells into cardiac tissue are described, for example, in PCT publication 
WO 98/54301 . In one example, stem cells are removed from a patient, their 
number expanded in culture, and then induced to differentiate as cardiac cells as 
described herein. At any time during the method, the DNA vector including the 
gene encoding the therapeutic gene may be introduced into the cells. The cells can 
1 0 transplanted into the patient or implanted into a different recipient of the same or 
different species. 

The donor species may be any species which is the same or different 
from the recipient species, and which is able to provide the appropriate cells (e.g., 
stem cells capable of differentiating into cardiac cells). The donor may be of a 

1 5 species which is allogenic or xenogenic to that of the recipient. Preferably, the 
recipient is a mammal, e.g., a primate. Most preferably the recipient is human. 
For human recipients, it is envisaged that human (i.e. allogenic) as well as pig (i.e. 
xenogenic) donors will be suitable, but any other mammalian species (e.g., bovine 
or non-human primate) may also be suitable as donors. For example, porcine 

20 aortic endothelial cells (PAEC), or the progenitor cells thereof, can be modified to 
express the gene encoding a therapeutic protein at effective levels, for grafting into 
a human recipient. 

Heterologous DNA can be inserted into the animal or an ancestor of the 
animal at the single-cell stage or early morula stage. The preferred stage is the 

25 single-cell stage although the process may be carried out between the two and 




eight cell stages. A transgenic animal can thereby be obtained, which will pass the 
heterologous DNA onto its offspring. 

Gene transfer can also be performed in allografts using e.\ vivo 
transduction of cells prior to transplantation, or for xenotransplantation, where 
5 porcine transgcnesis is established. Methods of preparing transgenic pigs are well 
known in the art, as discussed by Logan (C'urr. Opin. hnmunol. 12: 563-568, 2000) 
and the references cited therein. Any transgenic animal may be used in the present 
invention; pigs are particularly suitable because they are particularly amenable for 
xenotransplantation into a human recipient. Transgenic pigs may be produced by 

10 homologous recombination and other such techniques that destroy wild type gene 
function. By way of example, transgenic pigs may be produced utilizing 
homologous recombination techniques to produce a transgenic animal expressing 
the desired protein. 

For gene delivery, a variety of vectors or plasmids arc available (see, 

1 5 Maniatis et al. (1989) Molecular Cloning: A Laboratory Manual, Cold Spring 
Harbor, N.Y.; Ausubel et al.. Current Protocols in Molecular Biolog}'. Greene 
Publishing Associates, New York, V. I&2 1996; Harlow and Lane Antibodies, A 
Laboratory Manual, Cold Spring Harbor Laboratory, 1988, incorporated by 
reference herein). The common vectors described therein arc able to transiently or 

20 stably be maintained in smooth muscle cells, usually for a period of at least about 
one day, more usually for a period of at least about several days. 

Retroviral vectors (e.g., lentivirus), and in particular, replication-defective 
retroviral vectors lacking one or more of the gag, pol, and env sequences required 
for retroviral replication, are well-known to the art and may be used to transform 

25 endothelial or other mammalian cells. PA 3 1 7 cells or other producer cell lines 



producing helper-free viral vectors are well-described in the literature. A 
representative retroviral construct comprises at least one viral long terminal repeat 
and promoter sequences upstream ol the nucleotide sequence of the therapeutic 
substance and at least one viral long terminal repeat and polyadenylation signal 
5 downstream of the sequence. 

Vectors derived from adenoviruses, i.e. \ iruses that cause upper respiratory 
disease in humans and are present in latent infections in primates, arc also 
generally known in the art and useful in the present invention. The ability of 
adenoviruses to attach to cells at low ambient temperatures is an advantage in the 

10 transplant setting which can facilitate gene transfer during cold preservation of 
tissues or organs. Adenoviral mediated gene transfer into vessels or organs by 
means of transduction perfusion is also a means of modifying cells //; vivo. 

Prior to implantation into a recipient species, the treated cells may be 
screened for modified cells containing and expressing the construct. For this 

1 5 purpose, the DNA vector can also be provided with a second nucleotide sequence 
encoding an expression product that confers resistance to a selectable marker 
substance. Suitable selection markers for screening include the neo gene, 
conferring resistance to neomycin or the neomycin analog, G41 8. Although any 
mammalian cell can be targeted for insertion of the gene, the preferred cells for 

20 manipulation are stem cells. 

All publications cited herein are hereby incorporated by reference. 
What is claimed is: 



